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Many long noncoding RNA (IncRNA) species have been identified in gametes. However, the biogenesis and 
function of other categories of IncRNAs in gametes is poorly understood. Here, we profiled the expression of 
IncRNAs and mRNAs in spermatogonial stem cells (SSC), type A spermatogonia (A), pachytene 
spermatocytes (PS) and round spermatids (RS) by microarray analysis. We analyze the total expression of 
IncRNA/mRNA in these four germ cells and found that the maximum number of IncRNAs expression is in 
A (22127), and the minimum is in PS (14456). Also, the maximum number of mRNAs is in A (19923), and 
the minimum is in PS (13941). Furthermore, the trend in the number of specific IncRNAs was similar to the 
number of specific mRNAs in each type of germ cells (e.g., maximum in A and minimum in PS). The trend 
in the number of IncRNAs was similar to the number of mRNAs in two continued types of germ cells (e.g., 
maximum in SSC to A and minimum in PS to RS). The correlation analysis showed a high correlation 
coefficient of IncRNAs/mRNAs expression (R = 0.992). The results suggested that the sequential expression 
of long noncoding RNA as mRNA gene expression exhibits coordinated changes in male spermatogenesis. 



Spermatogenesis in testis is a coordinated process, which includes spermatogonial stem cells divided into 
spermatogonia and diploid spermatogonia differentiated into meiotic spermatocytes, which divide twice 
without additional DNA replication, producing haploid round spermatids 1 " 3 . Many genes and intricate 
gene regulation are involved as executors, which contribute to the complex nature of spermatogenesis. As 
spermatogenesis proceeds, the changing amounts and populations of mRNAs in the germ cells have been well 
documented by microarray analyses 3 " 7 . 

Meanwhile, spermatogenesis implies intricate gene regulation at both transcription and post-transcription 
levels. For example, a large number of non-coding transcripts including antisense RNAs, microRNAs and 
piRNAs are highly expressed in mammalian testis 8 " 10 . The transcription of noncoding RNAs of the mammalian 
genome is a larger portion than mRNAs 11 , some of which play important roles in cellular regulation, develop- 
ment, and disease 12 . The long noncoding RNAs (IncRNAs) are of particular interest because they are known to 
contribute to gene silencing 13 , X-inactivation 14 , imprinting 1516 , and development 1718 . However, little is known 
about the biogenesis and function of IncRNAs in gametes, and the transcription and function of IncRNAs in 
spermatogenesis. Therefore, it is important and meaningful to investigate the transcription and function of 
IncRNAs in spermatogenesis, which will contribute in the fields of reproduction and development. 

Here, we investigated the expression profiling of IncRNA/mRNA in specific stage cells, which included 
spermatogonial stem cells (SSC), type A spermatogonia (A), pachytene spermatocytes (PS), and round sperma- 
tids (RS), of mouse spermatogenesis by microarray analysis. We found that the maximum number of IncRNAs 
was in A, and the minimum was in PS, which is matched with the number of mRNAs in specific stages of mouse 
spermatogenesis. The trend in the number of specific/continued IncRNAs expression was similar to the number 
of specific/continued mRNA expression in mouse germ cells. The results suggest that the transcription of 
IncRNA/mRNA exhibits coordinated changes in spermatogenesis in males, which suggests that the IncRNAs 
may regulate the intricate gene at both transcription and post-transcription levels. It will promote the research of 
IncRNAs' functions in reproduction, development, and other fields in the future. 
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Results 

Expression profile of IncRNAs and mRNAs in mouse spermato- 
genesis. To profile IncRNA and mRNA expression in mouse 
spermatogenesis, SSC, A, PS, and RS were isolated and evaluated 
by morphology (Figure 1A) and germ cell-type-specific marker 
gene expression (Gfroc-1 for SSC, c-Kit for A, Scp3 for PS, and 
Prm2 for RS) (Figure IB). After purity identification (>90%), the 
mouse LncRNA Array v2.0 (8 X 60 K, Arraystar) was used to analyze 
the expression profile of IncRNAs and mRNAs in the four types of 
germ cells (Figure 2A). A total of 24316 IncRNAs and 21056 mRNAs 
were found by the microarray analysis (Figure 2A), and the primary 
data are presented in Supplemental Table S1A/S1B. We divided 
the IncRNAs/mRNAs profile into three groups: specific expression 
in each type of germ cells (e.g., 241 IncRNAs in SSC), expression 
in two continued types of germ cells (e.g., 17354 IncRNAs in SSC 
to A) and expression in all types of germ cells (11741 IncRNAs) 
(Figure 2A). To confirm the microarray analysis results, real-time 
quantitative PCR (Figure 2B and Supplemental Figure SI) was 
carried out to examine 10 representative IncRNAs (uc007ens.l, 
uc009coa.l, AK007004, uc007pzb.l, AK158457, AK163877, uc007ibs.l, 
Xist, AK090109, ENSMUST00000 118049). Real-time quantitative 
PCR confirmed microarray analysis results: uc007ens.l and 
AK007004 were highly expressed in SSC, and uc009coa.l and Xist 
was highly expressed in PS and RS. Xist RNA directs chromatin and 
transcriptional change by binding Polycomb repressive complex 2 
(PRC2), the epigenetic complex responsible for trimethylation 
of histone H3 at lysine 27 (H3K27me3), and by targeting PRC2 
to the X-inactivation, and X- reactivation cycle during mouse 
development 1920 . 

Chromosomal localization and classifications of IncRNAs and 
mRNAs. We obtained 24316 InRNAs and 21056 mRNAs in total 
from the microarray. According to the transcriptions of IncRNAs 
and mRNAs of which chromosome, overall, the IncRNAs and 
mRNAs are distributed on all of the chromosomes. About one- 
thousand IncRNAs or mRNAs were transcribed from each chromo- 
some, except the Y-chromosome (30 IncRNAs and 34 mRNAs in 
Y-chromosome) (Figure 3A). Furthermore, the trend in the number 
of IncRNAs was similar to mRNAs in each type of germ cells (e.g., 



maximum in A and minimum in PS) (Figure 3B and Figure 4), and 
the primary data are presented in Supplemental Table S2A/S2B. 
These results indicated that transcription of IncRNA genes 
coordinated with the transcription of protein-coding genes in 
mouse male germ cells. Consistent with previous studies 21,22 , 
IncRNAs expression exhibited temporal and spatial specificity as 
mRNAs' temporal and spatial expression in male spermatogenesis. 

In addition, based on the relationship between IncRNAs and 
nearby coding genes, IncRNAs were classified into four groups: sense 
overlap IncRNA (the LncRNA exon is overlapping a coding tran- 
script exon on the same genomic strand); antisense IncRNA (the 
LncRNA is transcribed from the antisense strand and overlapping 
with a coding transcript); bidirectional IncRNA (the LncRNA is 
oriented head to head to a coding transcript within 1000 bp); and 
intergenic IncRNA (there are no overlapping or bidirectional coding 
transcripts nearby the LncRNA) 22 . The intergenic IncRNAs consti- 
tuted the majority of total IncRNAs, and the number of intergenic 
IncRNAs changed dramatically in each germ cell (Figure 3C). Other 
groups of IncRNA constituted the minority of total IncRNAs, and the 
number of each group changed little in each germ cell (Figure 3C), 
which is in accordance with other published results 22 . These results 
suggested that intergenic IncRNAs may act as tissue- specific genes 
(actively expressed) and another group of IncRNA as housekeeping 
genes in spermatogenesis. 

Specific expression and continued expression of IncRNAs and 
mRNAs in mouse germ cells. In order to further explore the 
relationship between IncRNAs and mRNAs in mouse spermatogene- 
sis, we next analyzed specific expression of IncRNAs/mRNAs in each 
type of germ cells and expression of IncRNAs/mRNAs in two 
continued types of germ cells. The trend in the number of specific 
IncRNAs was similar to the number of specific mRNAs in each type 
of germ cell (e.g., maximum (3639 IncRNAs and 2738 mRNAs) in A 
and minimum (98 IncRNAs and 27 mRNAs) in PS) (Figure 4), and 
the primary data are presented in Supplemental Table S3A/S3B. The 
clusters of heat maps directly display this trend (Supplemental Figure 
S2). As is known, the postnatal development of male germ cells can 
be divided into three phases: mitotic, meiotic, and haploid 23 . In the 
mitotic phase, prospermatogonia differentiate into differentiated 
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Figure 1 | Four types of male germ cells used to construct the IncRNA/mRNA libraries. (A), Representative photographs of SSC, A, PS, and RS 
after isolation were shown. (B), Expression levels of the male germ cell specific marker genes transcripts (Gfra-1 for SSC, c-Kit for A, Scp3 for PS, and 
Prm2 for RS) were determined by real-time PCR. Data shown are the mean ± SEM of three separate experiments performed in triplicate. 
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Figure 2 | Schematic summary of IncRNAs/mRNAs expression profile analysis and validation of representative IncRNA expression. (A), Schematic 
overview of the experimental design for IncRNAs/mRNAs microarray analysis. (B), Total RNA was extracted from isolated male germ cells and then 
subjected to real-time PCR analysis for IncRNAs (uc007ens.l, uc009coa.l, AK007004, uc007pzb.l, AK158457, AK163877, uc007ibs.l, Xist, AK090109, 
ENSMUST00000118049) expression. These IncRNAs expression levels were normalized to endogenous GAPDH mRNA. Data shown are the mean ± 
SEM of three separate experiments performed in triplicate. 




spermatogonia that undergo multiple rounds of mitotic cell divisions 
before entering the meiotic phase and becoming spermatocytes. 
During meiosis, homologous chromosomes pair and cross over 
through recombination. Most of the testicular mRNAs were 
transcripted in A spermatogonia. Fewer mRNAs were transcribed 
in pachytene spermatocytes, a finding that has been confirmed by 
previous research 3,4 . To date, most of the testicular mRNAs known to 
be translationally regulated are initially transcribed in postmeiotic 
cells. Hence, more IncRNAs were transcripted in premeiotic cells 
before the mRNAs translation, which might regulate the proteins' 
function. The results implied a potential involvement of IncRNAs in 



the regulation of meiosis and spermatid differentiation during 
postnatal testicular development and spermatogenesis 22 . 

In addition, the trend in the number of IncRNAs was similar to the 
number of mRNAs in two continued types of germ cells (e.g., max- 
imum (17354 IncRNAs and 15247 mRNAs) in SSC to A and min- 
imum (12459 IncRNAs and 12840 mRNAs) in PS to RS) (Figure 4). 
Then, we analyzed all IncRNAs that displayed significant up- or 
down-regulation (> 5 -fold change) in two continued types of germ 
cells (SSC to A, A to PS, and PS to RS). By mapping IncRNAs and 
mRNAs that were significantly regulated and <30 kb in length, we 
identified numerous correlations between IncRNAs and mRNAs, 
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Figure 3 | Chromosomal distribution of the IncRNAs/mRNAs and classification of IncRNAs. (A), The number of IncRNAs/mRNAs localized on each 
chromosome. (B), The total number of IncRNAs/mRNAs in each type of male germ cells. (C), The number of four types of IncRNAs (sense, antisense, 
bidirectional and intergenic IncRNAs) existed in each type of male germ cells. 



and the data are presented in Supplemental Table S4. Next, gene 
ontology (GO) analysis showed that these differentially expressed 
mRNAs of all two continued types of germ cells are involved in 
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Figure 4 | Coordinated changes in transcription between IncRNAs and 
mRNAs. The number of IncRNAs and mRNAs in different classification 
groups of mouse male germ cells was listed. 



spermatogenesis (GO:0007283), male gamete generation (GO: 
0048232), sexual reproduction (GO:0019953), gamete generation 
(GO:0007276), multicellular organismal reproductive process (GO: 
0048609), reproduction (GO:0000003), etc. (Supplemental Figure S3). 

Coordinated transcription of IncRNAs and mRNAs gene pairs in 
mouse spermatogenesis. To conclude the regular pattern of IncRNAs/ 
mRNAs expression, the data from IncRNAs/mRNAs expression in 
different classification groups of mouse male germ cells was listed in 
Figure 4. With the correlation analysis of the IncRNAs/mRNAs 
expression trends, it showed a high correlation coefficient of IncRNAs/ 
mRNAs expression (R = 0.992) indicated that transcribed IncRNAs/ 
mRNAs gene pairs may be coordinately regulated during mouse 
spermatogenesis. To date, there were no reports that IncRNA expres- 
sion was related to mRNA expression. However, some research 
reported that a large fraction of these transcripts (>60%) origi- 
nated from divergent transcription at promoters of active protein - 
coding genes 22 . The divergently transcribed IncRNA/mRNA gene 
pairs exhibit coordinated changes in transcription when embryonic 
stem cells are differentiated into endoderm 22 . The results reveal 
that the transcription of most IncRNA genes is coordinated 
with the transcription of protein- coding genes during mouse 
spermatogenesis. 
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Discussion 

LncRNAs are thought to play important roles in testicular develop- 
ment and spermatogenesis 22 ' 23 . Using a microarray assay, we further 
analyzed the expression profiles of IncRNAs and mRNAs in four 
critical types of germ cells during mouse spermatogenesis, which 
included SSC, A, PS, and RS. 

SSCs are undifferentiated male germ cells that balance self- renew- 
ing and differentiating divisions to maintain spermatogenesis 
throughout adulthood 24 . Several studies have demonstrated that 
the key roles of some proteins and microRNAs are to maintain 
SSC survival and self- renewal 25 " 27 . In addition, IncRNAs are involved 
in the regulation of protein-coding genes expression and have bio- 
logical function as 'decoys' for miRNAs to regulate their targeting 
genes 28 ' 29 . In the present study, 241 specific IncRNAs was identified in 
SSCs, suggesting that these IncRNAs may play critical roles in main- 
taining SSC survival and self-renewal through protein-coding genes 
and microRNAs. There were also some specific IncRNAs in A, PS, 
and RS, which may participate in the regulation of meiosis and 
differentiation of mouse germ cells. 

Spermatogenesis is a continuous process. It is thus important that 
there should exist some regulators in all types of germ cells for giving 
impetus to progress of spermatogenesis. For example, the expanding 
family of CREB/CREM transcription factors is key molecular regu- 
lators at all stages of spermatogenesis 30 . In our study, 1 1741 IncRNAs 
were identified in all types of male germ cells, suggesting that these 
IncRNAs may function as housekeeping genes that affect the entire 
process of spermatogenesis. In addition, expression of IncRNAs in 
two continued types of germ cell types was examined, and they may 
regulate differentiation of specific stages during mouse spermatogen- 
esis. Bao et al. investigated expression profiling of both IncRNAs and 
mRNAs in the male germline progresses 31 . They focused on thou- 
sands of IncRNAs and hundreds of lincRNAs that are either up- or 
down- regulated during male germ cell development. They found that 
the highly regulated IncRNAs were correlated with nearby (<30 kb) 
mRNA gene clusters, which were also significantly up- or down- 
regulated, and identified IncRNAs associated with miRNA and 
piRNA clusters. The predicted function of IncRNAs were based on 
their location nearby the gene, miRNAs, and piRNAs 31 . Sun et al. 
examined IncRNA expression profiles of neonatal (6-day-old) and 
adult (8- week-old) mouse testes and found 3025 were differentially 
expressed during post-natal testis development 21 . Most differentially 
expressed IncRNAs exhibited epigenetic modification marks similar 
to protein-coding genes and tended to be expressed in a tissue- 
specific manner 21 . These previous studies mainly focused on the 
up- and down- expression of IncRNAs during spermatogenesis, or 
according to the nearby location of IncRNAs with coding genes to 
predict some IncRNAs function. However, Sigova et al. reported that 
the majority of IncRNAs in human and murine embryonic stem cells 
are produced from divergently transcribed protein- coding genes and 
that the divergently transcribed IncRNA/mRNA gene pairs exhibit 
coordinated changes in transcription when embryonic stem cells are 
differentiated into endoderm 32 . In this study, the trend in the number 
of IncRNAs was similar to the number of mRNAs in each type of 
germ cells and two continued types of germ cells, and the trend in the 
number of specific IncRNAs was similar to the number of specific 
mRNAs in each type of germ cell. Furthermore, these data showed a 
high correlation coefficient of IncRNAs/mRNAs expression, suggest- 
ing that total IncRNAs/mRNAs also exhibit coordinated changes in 
transcription. We analyzed the overall levels of IncRNAs and 
mRNAs expression during spermatogenesis; the results also indi- 
cated that intergenic IncRNAs may act as tissue-specific genes (act- 
ively expressed), and sense overlap IncRNA, antisense IncRNA, 
bidirectional IncRNA as housekeeping genes in spermatogenesis. 
Thus, we think that analysis of the up- and down-regulated inter- 
genic IncRNAs in previous studies 21 ' 31,33 may be reliable. However, 
these methods ignored the unchanged IncRNAs (sense overlap 



IncRNA, antisense IncRNA, and bidirectional IncRNA) between 
two germ cells, which may be important during spermatogenesis. 

In summary, our study here defined the transcription mechanism 
of IncRNAs/mRNAs in mouse male germ cells, a process that may 
contribute to regulating meiosis and differentiation during mouse 
spermatogenesis. Further understanding of coordinated changes in 
transcription of total IncRNAs/mRNAs will provide new clues to 
diagnose and treat reproductive disorders. 

Methods 

Animals and germ cell isolation. C57BL/6J male mice were originally purchased 
from Vital River Laboratories in Beijing, China. Spermatogonia stem cell (SSC) were 
isolated from 6-days postpartum (dpp) using enzymatic digestion and magnetic- 
activated cell sorting 34 . Type A spermatogonia (A) were isolated from 8-dpp using 
enzymatic digestion and differential plating 34 . Pachytene spermatocytes (PS) and 
round spermatids (RS) were isolated from 17-dpp and adult mice using the unit 
gravity sedimentation procedure as described previously 10 ' 35 ' 36 . Thirty-five, twenty- 
five, fifteen, and five mice of corresponding ages were used to isolate SSC, A, PS, and 
RS, respectively. The purity of the four germ cell types was >90% based on 
morphological evaluation and was verified by real-time quantitative PCR detection of 
germ cell type-specific marker genes. All thees experiments on live vertebrates were 
performed in accordance with the relevant guidelines and regulations. This study 
received ethical approval from the institutional review boards of the International 
Peace Maternity & Child Health Hospital, School of Medicine, Shanghai Jiaotong 
University. 

LncRNA and mRNA microarray analysis. LncRNA and mRNA expression profiles 
of SSC, A, PS, and RS were generated by applying the Agilent Array platform (Palo 
Alto, CA). All procedures were carried out according to manufacturer's standard 
protocols. Briefly, total RNA was extracted from each sample of germ cells using 
Trizol (Invitrogen, Carlsbad, CA), and functional RNA was purified from total RNA 
after removal of rRNA (mRNA-ONLY™ Eukaryotic mRNA Isolation Kit, Epicentre, 
Madison, WI) according to the manufacturer's instructions. Then, each sample was 
amplified and transcribed into fluorescent cRNA along the entire length of the 
transcripts without 3 ' bias utilizing a random priming method, and the labeled 
cRNAs were purified by RNeasy Mini Kit (Qiagen, Valencia, CA). The concentration 
and specific activity of the labeled cRNAs (pmol Cy3/ ug cRNA) were measured with a 
NanoDrop ND-1000 Spectrophotometer (Nano-Drop Technologies, Wilmington, 
DE). The labeled cRNAs were hybridized onto the Mouse LncRNA Array v2.0 (8 X 
60 K, Arraystar). This microarray contains 31,423 IncRNA probes collected from 
RefSeq_NR, UCSC_knowngenes, NRED, Fantom 3.0, and 25,376 mRNA probes. The 
IncRNA probes involve all 20 pairs of chromosome and mitochondrial genome. Each 
transcript is represented by a specific exon or splice junction probe which can identify 
individual transcript accurately. The probes were synthesized by Agilent technology, 
which made the probes was stable, uniform, reliable. Also, the numbers of each probe 
were equal and the quantities of the probes were the same. After having washed the 
slides, the arrays were scanned by the Agilent Scanner G2505C, and the acquired 
array images were analyzed by Agilent Feature Extraction software (version 11.0.1.1). 
Quantile normalization and subsequent data processing were performed using the 
GeneSpring GX vl 1.5.1 software package (Agilent Technologies). After quantile 
normalization of the raw data, IncRNAs or mRNAs were all detected in each group, 3 
samples have flags in Present or Marginal ("All Targets Value"), were chosen for 
further data analysis. Differentially expressed IncRNAs and mRNAs with statistical 
significance between two groups of germ cells were identified by Volcano Plot 
filtering with a threshold of fold-changes > 2 and p-values < 0.05. Pathway analysis 
and GO analysis were applied to study the roles of these differentially expressed 
mRNAs played in these biological pathways or GO terms. Finally, Hierarchical 
Clustering was performed to show the distinguishable LncRNAs and mRNAs 
expression patterns among the germ cells. 

RNA extraction and real-time quantitative PCR. Total RNA of mouse germ cells 
was extracted with Trizol (Invitrogen, Carlsbad, CA). RNA was reverse transcribed 
into cDNA using a PrimeScript RT reagent kit (TaKaRa Bio, Inc., Otsu, Japan) 
according to the manufacturer's instructions. Real-time PCR was performed on an 
ABI Step One System (Applied Biosystems, Foster City, CA) using a SYBR Premix Ex 
Taq II kit (TaKaRa Bio, Inc.), as described previously 37 ' 38 . The relative expression 
levels of IncRNAs (uc007ens.l, uc009coa.l, AK007004, uc007pzb.l, AK158457, 
AK163877, uc007ibs.l, Xist, AK090109, ENSMUST00000 118049) were calculated 
using the comparative 2 — AACt method 39 and were normalized to endogenous 
GAPDH mRNA and 18S rRNA. The primer sequences for mouse Gfrod, cKit and 
GAPDH were previously described 40 and for mouse Prm2 41 . Primers for uc007ens.l, 
uc009coa.l, AK007004, uc007pzb.l, AK158457, AK163877, uc007ibs.l, AK090109, 
ENSMUST00000 118049, Scp3 and 18S rRNA are listed in Supplemental Table S5. 

Statistical analysis. Experiments were repeated at least three times and the values are 
presented as means ± SEM. Means of groups were analyzed by Student's t-test or the 
ANOVA test when appropriate. P-values of less than 0.05 were considered to be 
statistically significant. 
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